Interconnection network with a shared whiteboard: 
Impact of (a)synchronicity on computing power * 



F. Becker^, A. Kosowski^, N. Nisse^, I. Rapaport^'^, and K. Suchan^'''' 

^ LIFO, Universite d'Orleans, France 
^ LaBRI, INRIA Bordeaux Sud-Ouest, France. 
^ MASCOTTE, INRIA, I3S(CNRS/Univ. Nice Sophia Antipolis), France. 
* Departamento de Ingenierfa Matematica, Universidad de Chile, Santiago, Chile. 
'y-L^'. ^ Centro de Modelamiento Matematico (UMI 2807 CNRS), Universidad de Chile, Santiago, Chile. 

y—^ ' Facultad de Ingenierfa y Ciencias, Universidad Adolfo Ibanez, Santiago, Chile. 

' ^ Faculty of Applied Mathematics, AGH - University of Science and Technology, Cracow, Poland. 

(N : 

(D ■ 

' Abstract. In this work we study the computational power of graph-based models of distributed 

Q,^ , computing in which each node additionally has access to a global whiteboard. A node can read 

■ the contents of the whiteboard and, when activated, can write one message of O(logn) bits on it. 
When the protocol terminates, each node computes the output based on the final contents of the 

I I , whiteboard. We consider several scheduling schemes for nodes, providing a strict ordering of their 

' power in terms of the problems which can be solved with exactly one activation per node. The 

I problems used to separate the models are related to Maximal Independent Set, detection of cycles 

• , of length 4, and BFS spanning tree constructions. 

1 Introduction 

> : 

, A distributed system is a network where nodes correspond to agents or processors and links 

' express the local knowledge of the nodes. To perform any calculation - like deciding some 

' network's property - nodes may exchange information by interacting locally, i.e., with their 

, neighbors. Since nodes lack of global knowledge, new algorithmic and complexity notions arise. 

\ In contrast with classical algorithmic theory - where the Turing machine is the consensus for- 

■ mal model of algorithm - in distributed systems many different models are considered. Under 
• • . the paradigm that communication is much slower and more costly than the local computation, 

\ complexity analysis of distributed algorithms mainly focuses on message passing between the 

^ ' nodes. That is, an important performance measure is the number and the size of messages that 

^ . are exchanged for performing some computation. Some theoretical models were conceived for 

studying particular aspects of protocols such as fault-tolerance, synchronism, locality, conges- 
tion, etc. One of the main questions arising is to determine the global properties of the network 
that can be computed locally. 

In the model Congest [17], a network is represented by a graph whose nodes correspond to 
network processors and edges to inter-processor links. The communication is synchronous and 
occurs in discrete time rounds. In each round, each of the n processors can send a message of 
size O(logn) bits through each of its outgoing links. A restriction of the Congest model has 
been proposed by Grumbach and Wu to study frugal computation [9] . In this model, where the 
total amount of information traversing each link is bounded by O(logn) bits, they showed that 
any first order logic formula can be evaluated in any planar or bounded degree network [9]. 

In [3], Becker et al. investigated a variation of Congest inspired by the Simultaneous 
Message Model defined by Babai, Kimmel and Lokam [2]. In this model, the total amount 
of local information that each node may provide is bounded by O(logn) bits. However, to 
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compensate the little amount of knowledge that is shared, the communication is global: each 
node directly transmits its message to a central authority, the referee, that collects and uses 
them to answer some question about the network. This model allows to abstract away from the 
cost of transmitting data throughout the network, and to look at how much local information 
must be shared in order to compute some property. 

More precisely, each of the n nodes, knowing only its own ID, the IDs of its neighbors and 
the size of the network, is allowed to send one message of O(logra) bits to the referee. Then, the 
referee can use the information it received to answer some question. Becker et al. asked whether 
this small amount of local information provided by each node is sufficient for the referee to 
decide some basic structural properties of the network topology G [3]. For instance, simple 
questions like "Does G contain a square?" or "Is the diameter of G at most 3?" cannot be 
solved. On the other hand, the referee can decode the messages in order to have full knowledge 
of G when G belongs to one of many graph classes such as planar graphs, bounded treewidth 
graphs and, more generally, bounded degeneracy graphs [3]. 

In this paper, we define natural extensions of the model in [3] and investigate the computa- 
tional power of these new models. 

Computations using a shared whiteboard. The computational model in [3] can be stated 
equivalently in the following form. Given a question on the topology of the network, every node 
writes simultaneously one message (computed from its local knowledge) on a global zone of 
shared memory, a whiteboard, and then, each of the nodes must answer the question by using 
only the content of the whiteboard. 

In this paper, we intend to give more power to the initial model of [3]. For this purpose, we 
relax in different ways the simultaneity constraint. Roughly, messages may be written sequen- 
tially on the whiteboard. This allows nodes to compute their message taking into account the 
content of the whiteboard, i.e., the messages, or part of the messages, that have previously been 
written. In other words, in the new models we propose, nodes have extra ways to share infor- 
mation. Basically, the four models we now present aim at describing how the nodes can access 
the shared medium, in particular, differentiating synchronous and asynchronous networks. 

The time is divided into discrete steps and, at every step, each node can see the current 
content of the whiteboard and may perform some local computation according to this content 
and its local knowledge. Along the evolution of the system, the nodes may be in three states: 
awake, active, or terminated. Initially, all nodes are awake. A node becoming active means that 
this node would like to write a message on the whiteboard, metaphorically speaking, it "rises its 
hand to speak" . Finally, a node is in state terminated when its message has been written on the 
whiteboard. During one step, several awake nodes may become active and exactly one active 
node becomes terminated. To model the worst-case behavior of a model, the choice of the node 
that becomes terminated is done by an adversary among the set of active nodes. Note that a 
node may become active and terminated during one step. After the last step, when all nodes 
are terminated, all of them must be able to answer the question by using only the information 
stored on the whiteboard. 

In this setting, we propose several scenarios leading to the definition of four computational 
models. A computational model is said free if, at any step, any awake node may decide to become 
active based on its knowledge and on its own protocol. On the other hand, the model is said 
simultaneous if all nodes are forced to become active during the first step. The other criterion we 
use to distinguish models is the state-transition during which a node must create the message 
it will eventually write on the whiteboard. In the asynchronous scenario, the nodes must create 
their message during the step when they become active. In the synchronous scenario, the nodes 
must create their message during the step when they become terminated. 
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Intuitively, in the asynchronous scenario, a node becoming active must compute its message 
regarding the content of the whiteboard at this step. However, this message is actually written 
on the whiteboard only when the node becomes terminated, which depends on the choice of the 
adversary. Thus, there may be some delay between the creation of a message and the step when 
it is written. In particular, the order in which the messages are created and the order in which 
they are actually available on the whiteboard may differ. In this way, we can model real-world 
asynchronous systems where there are no guarantees on the time of communications. 

In this paper, we combine the free/simultaneous and asynchronous/synchronous scenarios 
and study the four resulting models. In particular, it is easy to check that the simultaneous- 
asynchronous model exactly corresponds to the model studied in [3]. On the other hand, the 
free-synchronous model is inspired by the Multiparty Communication Protocol introduced by 
Chandra, Furst and Lipton [4]. We aim at deciding which kind of problems can be solved in 
different models. Moreover, we intend to show that these models form a hierarchy in which the 
computation power increases strictly. 

Related work. Many variations to the Congest model have been proposed in order to focus 
on different aspects of distributed computing. In a seminal paper, Linial introduced the Local 
model [12, 17]. In the Local model, the restriction on the size of messages is removed so that 
every vertex is allowed to send unbounded size messages in every round. This model focuses on 
the issue of locality in distributed systems, and more precisely on the question " What cannot be 
computed locally?" [11]. Difficult problems like minimum vertex cover and minimum dominating 
set cannot be well approximated when processors can locally exchange arbitrary long messages 
during a bounded number of rounds [11]. 

In the centralized computing point of view, testing a property V oi a graph G may consist of 
determining the minimum number of elementary queries (e.g., 'what is the z*'* neighbor of some 
vertex V?") necessary to decide whether the graph satisfies it. This model was first studied by 
Goldreich et al. [7] (sec [6] for a survey). In this context, probabilistic algorithms are given that 
always accept a graph if it satisfies the property and reject with constant probability any graph 
that is "far enough" from the property. For instance, [1] gives lower and upper bounds for testing 
the triangle-freeness in general graphs. Closer to our model, Goldreich and Ron provide efficient 
algorithms for testing the connectivity of bounded degree graphs when the list of neighbors of 
every vertex is given [8]. 

Other trade-offs between the size of a data structure and the complexity (in terms of the 
number of bits that must be checked in this structure) of algorithms for solving some problems 
have been provided using the communication complexity model and the cell probe model [19, 
20, 13, 14]. Testing graph properties has also been widely investigated using these frameworks 
(e.g., [18,5,16]). 

2 Communication models 
2.1 Protocol formulation 

An interconnection network is modeled by a simple undirected connected n-node graph G = 
iy^E). Each node v ^ V has a unique identifier ID{v) between 1 and n. Typically, V = 
{vi, . . . , Vn}, where Vi is such that ID{vi) = i. Throughout the paper, a graph must be under- 
stood as a labeled graph. 

At each node v eV there is an independent processing unit that knows the local knowledge 
of v: its own identifier, the identifier of each of its neighbors and the total number of nodes n. 
Moreover, any node v & V has a variable output^ with arbitrary initial value and a variable 
status^ G {awake, active, terminated} initially set to awake. 
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Nodes can communicate with each other through a shared memory, caUcd whiteboard and 
denoted by B, that is initiaUy empty. Any node can read B and any active node is allowed to 
write exactly one 0(logn)-bit message on it. 

The time is divided into discrete steps. At any step, each node executes the same algorithm 
A, or protocol, which may be divided into three sub-procedures. For any awake node v, the 
activation function takes the local knowledge of v and the current content of B as input and 
decides to modify the status^ variable to active or to maintain it as awake. For any active node 
V, the message function takes the local knowledge of v and the current content of ;B as input and 
computes a message of O(logn) bits. In particular, we may assume that rriy always contains 
the identifier of v and the number of messages present on B at the step when is created. It 
can serve as a signature with the identity of the author and the "time" of its creation. When n 
messages have been written on B, the decision function takes the (final) content of B as input 
and computes the final value of output^. While some nodes are not in the terminated state, an 
active node v is chosen at the end of each time step and ruy is written on B, and then status^ 
is set to terminated. 

Given a problem^, our goal is to design an algorithm A such that, at the end of the process 
(after the last step), and for any graph G, all nodes agree on the solution of the problem. That 
is, once all nodes have executed the decision function, all variables output^, for any v (z V, 
give the correct answer for the problem. In what follows, to model the worst-case, the choice of 
which active node is chosen is done by an adversary. In this setting, we say that a problem can 
be solved in our model if there exists such an algorithm. It is important to note that, to avoid 
deadlock, a valid algorithm - precisely, its activation procedure - must ensure that, at any step, 
at least one node is active. 

2.2 Scheduling of whiteboard access 

We now specify four models of computation, following the general framework described above, 
that we study in this paper. For this purpose, we propose two additional constraints that may 
be satisfied or not, depending on the model. 

In order to model how nodes access the whiteboard, i.e., in a synchronous way or not, 
we consider the following two variants: either a node must create its message as soon as it 
becomes active, or a node may defer creating its message until it has actually been chosen by 
the adversary to write it on the whiteboard. In the latter case, the node may take advantage 
of all the messages that have been written since it became active. More formally, we impose 
that any node executes the message function only once: in the asynchronous model, it must be 
executed during the same step when it becomes active, while in the synchronous model, it is 
executed when the node becomes terminated. 

The second constraint we consider aims at avoiding an initial deadlock. Indeed, in some 
distributed contexts, it is not possible to decide whether it is better to write some message on 
the empty whiteboard or to wait a first message to be written. To deal with this, we consider 
that either the nodes may be free to decide when to become active, or they may be forced to be 
all active at the beginning. More formally, in the simultaneous model, the activation function 
is imposed to be the function that turns the status variable of each node to active during the 
first step (when the whiteboard is empty). In the free model, the definition of the activation 
function is part of the algorithm's design. 

* Problems we are considering typically consist of a question on the graph's topology and the output of which 

may be a boolean value (decision problems), the adjacency matrix of the graph, etc. However, our models may 
also be used to deal with classical distributed problems like consensus, leader election, etc. 
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message created when node becomes active 


message created when node is chosen 


all nodes initially active 


SimAsync 


SimSync 


no node initially active 


FreeAsync 


FreeSync 



Table 1. Classification of communication models. 



To satisfy or not this pair of constraints leads to four different communication models. We 
now detail the four communication models studied in this work. These models are summarized 
in Table 1. 

Simultaneous Asynchronous (SimAsync). In this model, all nodes become active and create 
their messages at the first step. In other words, all nodes create their messages while the white- 
board is still empty. Hence, the message created by a node v only depends on the local knowledge 
of V and on n. Moreover, the ordering in which the messages are written on B (i.e., in which the 
nodes are chosen by the adversary) is clearly not relevant. This is exactly the communication 
model studied in [3]. 

Simultaneous Synchronous (SimSync). In this model, all nodes become active in the first step. 
However, messages are created just before being written on the whiteboard. In other words, 
the adversary chooses an ordering (xi, . . . ,Xn) of V such that, at step 1 < i < n, vertex Xi 
computes its message according to its local knowledge, the total number of nodes n, and the 
i — 1 messages that have previously been written hy xi, . . . ,Xi-i on B. Note that the ordering 
is not known a priori by the nodes. Hence, an algorithm for solving a problem in this model 
must solve it for any ordering chosen by the adversary. 

Free Asynchronous (FreeAsync). In this model, the nodes decide when to turn active and 
create their messages at the same step that they become active. That is, a node may create a 
message long before being chosen by the adversary to write it down on B. 

Free Synchronous (FreeSync). The nodes decide when to become active and create their 
messages only when they are asked by the adversary to write them on B. 

This paper aims at deciding what kind of problems can be solved in each of these models. 
For instance, [3] proves that deciding if a graph has degeneracy k, k > 1, can be solved in 
SimAsync. On the opposite, deciding whether a graph contains a triangle as a subgraph and 
deciding whether a graph has diameter at most 3 cannot be solved in SimAsync [3]. 

In Section 3, we focus on problems that separate the models, i.e., that can be solved in some 
model but not in another one. In Section 4, we focus on problems related to connectivity. 

First of all, we prove the following lemma that extends a result of [3]. Let BUILD be the 
problem that consists in computing the adjacency matrix of an input graph G. 

Lemma 1. Let Q he a family of n-node labeled graphs, and g{n) he the number of graphs in Q. 
In any of the four considered models, BUILD can he solved in the class Q only if log g{n) = 
0{n logn). 

Proof. Consider any algorithm in one of the four considered models. In any model, at the end 
of the communication process, n messages of size O(logn) bits are written on B. Notice that 
includes the information on the order in which the messages have been created. Hence, at the 
end, a total of O(nlogn) bits are available on the whiteboard. For any node to distinguish two 
different graphs in Q, we must have logg{n) = 0{nlogn). □ 
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For the ease of descriptions, in what follows wc will not define explicitly the functions for 
activation, message creation and decision. Nevertheless, they always will be clear from the 
context. 

3 A strict hierarchy 

In this section, we intend to show that these models form a hierarchy in which the computation 
power strictly increases. Let us formalize this idea. We say that model Y is more powerful than 
model X, denoted by X < y, if every problem that can be solved in X can also be solved in Y. 
Moreover, Y is strictly more powerful than X, denoted by X < y, if X < y and there exists a 
problem V that can be solved in Y but not in X. 

The main result of this section is the following theorem: 

Theorem 1. SimAsync < SimSync < FreeAsync < FreeSync. 

Wc start with the following weaker result: 

Proposition 1. SimAsync < SimSync < FreeAsync < FreeSync. 

Proof. Given two models X,Y, to show that X < y, we consider an algorithm for solving some 
problem in X and show how to turn this algorithm to satisfy requirements of Y. 

- SimAsync < SimSync. It suffices that nodes in the SimSync protocol ignore the messages 
present on the whiteboard when they create their own. 

- SimSync < FreeAsync. Recall that a problem is solved in the SimSync model if the 
nodes compute the output no matter the order chosen by the adversary. So we can translate 
a SimSync protocol into a FreeAsync one if wc fix an order (for instance vi,. . . ,Vn) and 
use this order for a sequential activation of the nodes. 

- FreeAsync < FreeSync. It is the situation of the first inequahty. It suffices to force the 
protocols in FreeSync to create their messages based only on what was known at the 
moment when they became active. 

□ 

3.1 SimAsync vs. SimSync 

We consider here a "rooted" version of the Inclusion Maximal Independent Set problem. 
This problem, denoted by MIS, takes as input an n-node graph G = (V, E) together with an 
identifier ID{x), x E V, and the desired output is any maximal (by inclusion) independent set 

containing x. 

Proposition 2. MIS can be solved in the SimSync model. 

Proof. Recall that in the adversarial model, all nodes are initially active and that the adversary 
chooses the ordering in which the nodes write their messages. Hence, an algorithm in this model 
must specify the message created by a node v, according to the local knowledge of v and the 
messages written on the whiteboard before v is chosen by the adversary. 

The protocol is trivial (it is the greedy one). When node v is chosen by the adversary, the 
message of v is either its own ID (meaning that v belongs to the final independent set) or v 
writes "no" (otherwise). The choice of the message is done as follows. The message is ID{v) 
either \iv = xoT:iiv^ N{x) and ID{y) does not appear on the whiteboard for any y G N{v). 
Otherwise, the message of v is "no". 

Clearly, at the end, the set of vertices with their IDs on the whiteboard consists of an 
inclusion maximal independent set containing x. □ 
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Proposition 3. MIS cannot be solved in the SimAsync model. 

Proof. We proceed by contradiction. Let us assume that there exists a protocol A for solving 
MIS in the SimAsync model. Then we show how to design an algorithm A' to solve the BUILD 
Problem for any graph in this model, contradicting Lemma 1. 

Let G = {V,E) be a graph with V = {vi, ...,?;„}. For any 1 < i < j < n, let C'fj be 
obtained from G by adding a vertex x adjacent to every vertex in V with the exception of Vi 
and Vj. Note that {x,Vi,Vj} is the only inclusion maximal independent set containing x in Gf^j 
if and only if {vi, vj} ^ E. Indeed, if {vi, Vj} G E, there are two inclusion maximal independent 
sets containing x: {x,Vi} and {x,Vj}. 

Recall that, in the SimAsync model, all nodes must create their message initially, i.e., while 
the whiteboard is still empty. Hence, the message created by a node only depends on its local 
knowledge. We denote by A{vk, G^^j) the message created by node Vk following protocol A when 

(x) 

the input graph is Gl ■ . 

Notice that, for a given k, the node Vk can generate only two possible messages .4(ufe,G^^^) 
depending on whether k € or k ^ {hj}- Therefore, we call the message that 

generates when k € {i,j} (i.e., x and Vk are not neighbors) and m'^ the message Vk generates 
when k ^ (i.e., x and Vk are neighbors). 

Prom the previous protocol A we are going to define another protocol A' in the SimAsync 
model which solves the BUILD Problem for any graph. Protocol A' works as follows. Every 
node Vk generates the pair {irik, m'/S) of the two messages v^ would send in A when it is adjacent 
to x and when it is not. Clearly, this consists of O(logn) bits. 

Now let us prove that any node can reconstruct G = (V, E) from the messages generated 
by A'. More precisely, for any 1 < s < t < n, any node can decide whether {vs,vt} E E or 
not. It is enough for any node to simulate de decision function of A in G). / by using messages 
ms,mt and {m^ : fc G {1, • • • ,n} \ {s,t}}. Since the output of A is {x,Vs,vt} if and only if 
{vs,vt} ^ E, the results follows. 

This means that from O(nlogn) bits we can solve BUILD in the class of all graphs - a 
contradiction. □ 

Corollary 1. SimAsync < SimSync. 

We discuss now another problem that could possibly separate the two models. Given an 
(n — l)-regular 2n-node graph G, the 2-Cliques problem consists in deciding whether G is the 
disjoint union of two complete graphs with n vertices or not. 

It is easy to show that 2-Cliques can be solved in the SimSync model. Indeed, a trivial 
protocol can partition the vertices into two cliques numbered and 1 if the input consists of 
two cliques, or otherwise indicate that it is not the case. The first vertex / to be chosen by the 
adversary writes (I-D(/), 0) on B. Then, each time a vertex v is chosen, it writes {ID{v),0) if it 
"believes" to be in the same clique as /, and (ID{v), 1) otherwise. More precisely, let be the 
subset of neighbors of v that have already written a message on the whiteboard. If 5"^, = then 

V writes 1. If all nodes in Sf, have written that they belong to the the same clique c G {0, 1} then 

V writes c, and v writes "no" otherwise. Clearly, G is the disjoint union of two cliques if and 
only if there is no message "no" on the whiteboard at the end of the communication process. 

Proving that 2-Cliques cannot be solved in the SimAsync model is an interesting question 
because it would allow us to show that Connectivity (deciding whether a graph is connected 
or not) cannot be solved in the SimAsync model. Indeed, it is easy to show that an (n — 1)- 
regular 2n-nodc graph is the disjoint union of two cliques if and only if it is not connected. We 
leave this as an open question: 
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Open Problem 1. Can 2- Cliques be solved in the SimAsync model? 
3.2 SimSync vs. FreeAsync 

Let Square be the problem that consists in deciding whether a graph G contains a square 
(induced or not), i.e., whether V{G) contains four vertices a, 6, c and d such that a is adjacent 
to h which is adjacent to c which is adjacent to d which is adjacent to a. 

In [3], it is proved that Square cannot be solved in the SimAsync model. The proof consists 
of showing that if SQUARE could be solved in the SimAsync model, then BUILD could be 
solved, in this model, in the class of square-free graphs, a contradiction. 

Theorem 2. [3] Square cannot he solved in the SimAsync model. 

We extends this result to the SimSync model. More precisely, we show that Square cannot 

be solved in SimSync model even restricted to a specific class of graphs. We then show that, 
in this particular graph class, SQUARE can be solved in the FreeAsync model. Hence, the 
Square problem on this class of graphs separates SimSync and FreeAsync. 

First, let C be the class of graphs of even order N = 2n {n > 1) that can be obtained 
as follows. G e C has the vertex set {vi, ■ ■ ■ ,Vn, Vn+i, • • • , V2n} where G[{vi, • • • , Vn}] induces 
a square-free n-node graph H, and, for any 1 < i < n, vij^n is adjacent to Uj, and finally, 
there is a unique additional edge between Vn+i and for some 1 < i < j < n (i.e., for any 
k € {1, • • • ,n} \ {i,j}, Vn+k has degree one in G). In the following, we note G = {H,i,j). Note 

3/2 3 /2 

that, since there are 12(2" ' ) (labeled) square-free graphs with n nodes [10], \C\ = Q{2^ ) = 
^2(2^'^'). 

Proposition 4. Square cannot he solved in the SimSync model, even when inputs are re- 
stricted to C. 

Proof. For purpose of contradiction, let us assume that there is a protocol V for solving Square 
in C in the SimSync model. We design a protocol P' for solving BUILD in C, contradicting 
Lemma 1 since NlogN = o(log |C|). 

In the SimSync model, the nodes are asked to write their messages in some order. When it 
is the turn of node Vi it computes its message according to the current content of the whiteboard 
and to its own neighborhood. 

Let G = {H, £, k) G C. The key point is that V solves Square in G whatever be the order 
in which the vertices write their messages. In particular, if the vertices in {vi,--- ,Vn} are 
interrogated first, their messages cannot bring any information on the single edge between two 
vertices in {vn+i, • • • , V2n}- Moreover, the vertices in {vn+i, ■ ■ ■ , V2n} have degree at most two 
in G and so can write their full neighborhood. Using these two facts, we design V' that will be 
used to rebuild any graph in C. 

The protocol V' is defined as follows. For any i < 2n, the message created and written by 
node Vi when it is interrogated consists of 

— if n < i < 2n, then Vi writes the identifiers of its at most two neighbors; 

— otherwise, let O be the sequence of the vertices in {f i, • • • , that have been interrogated 
before Vi, in order. Using its local neighborhood and the messages previously written by the 
vertices in O, Vi writes the message it would have written following V and in the ordering 
O. 

We now show that whatever be the ordering O in which the 2n vertices have been interro- 
gated, the final content of the whiteboard allows any vertex to build the adjacency matrix of 
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G. More precisely, we show that any node can decide whether the edge {vi,Vj} G E{G) for any 
I <i < j <2n. 

Clearly, for any 1 < i < n, the edges adjacent to Vn+i can be decided since they appear 
explicitly on the whiteboard (in particular for z = £ and j = k). Let O' be the restriction of O 
to the vertices in {vi, ■ ■ ■ ,Vn}- Now, for any 1 < i < j < n, wc show that, using the information 
on the whiteboard, any node can simulate the protocol V in the graph {H,i,j), against the 
ordering O' {vn+i, • • • ,V2n) and decide whether {vi,Vj} does exist in G which is the case if 
and only if {H,i,j) has a square. 

After the execution of the protocol, the whiteboard contains the messages mi, • • • , nin that 
would have written hy vi, - ■ ■ ,Vn when following protocol V against the ordering C. Using this 
information, any node can compute the message that Vt {n < t < 2n) would have written when 
executing V in {H,i,j). Therefore, it can decide whether {H,i,j) has a square, i.e., whether 
{vi,vj} e E{G). 

Since by Lemma 1 and because of the cardinality of C, no protocol can solve BUILD in C 
and in the SimSync model, we get a contradiction. □ 

Corollary 2. Square cannot be solved in the SimSync model. 

Proposition 5. Square can be solved in C in the FreeAsync model. 

Proof. The protocol is almost trivial. First, any node with identifiant at least n + 1 becomes 
active and creates a message containing its neighborhood (recall that it has at most two neigh- 
bors in G). By reading the n messages that have been written by Vn+i, ■ ■ ■ ,V2n, all remaining 
nodes know the (unique) pair such that {vi+n,Uj+n} € E{G) and know the neighbor u of 
Vn+i, resp., the neighbor v of f j+n, in {vi, ■ ■ ■ , Vn}- Finally, all remaining vertices become active: 
for any k < n, writes an empty message if f ^. ^ {u, v} and u and v write "yes" if they are 
adjacent and "no" otherwise. Clearly, the graph admits a square if and only if the whiteboard 
eventually contains "yes". □ 

Corollary 3. SimSync < FreeAsync. 

Open Problem 2. Can Square be solved in the FreeAsync model, in FreeSync? 
4 Connectivity and related problems 

One of the main questions arising in distributed environment concerns connectivity. For instance, 
one important task in wireless network consists in computing a connected spanning subgraph 
(e.g., a spanning tree) the links of which will be used for communications. In this section, we 
ask in which of our models such problems can be solved. 

We consider the following three problems. By increasing level of difficulty^: the Connec- 
tivity Problem asks if an input graph is connected or not; given an input graph G and an 
input identifier ID{r), the Spanning-Tree Problem requires as output a spanning-trcc of G 
rooted in r if it exists; similarly, the BFS Problem requires a BFS-tree of the input graph G 
rooted in some given node (if G is connected) as an output. 

In [3] the authors conjecture that CONNECTIVITY (and therefore Spanning-Tree) cannot 
be solved in the SimAsync model. This is still an open question. We do not even know whether 
these problems can be solved in the SimSync model. 

Nevertheless, it is clear that Spanning-Tree (and therefore Connectivity ) can be solved 
in the FreeAsync model. The protocol is the greedy one. First the root r becomes active and is 

^ Wc say that a problem V is more difficult than a problem V' if any algorithm for solving V also solves V' , or 
equivalently, any solution for P is a solution for V' 
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chosen by the adversary and writes its ID. Then, at each step, nodes having a neighbor already 
included in the spanning tree become active and choose its parent. 

In the more powerful FreeSync model it is possible not only to construct a spanning tree 
but a BFS tree. Table 2 summarizes the results of this section. 





SimAsync 


SimSync 


FreeAsync 


FreeSync 


BPS 


No 


No 


7 


Ok 






(Prop. 7) 


Ok ill Bipartiti' graphs (Cor. 1) 


(Til. (i) 


Spanning-tree 


7 


? 


Ok (Remark above) 


Ok 


Connectivity 


? 


? 


Ok 


Ok 



Table 2. Various problems related to connectivity and models where they can(not) be solved. 



Proposition 6. BFS can be solved in the FreeSync model. 

Proof. The idea is to simulate phases at the end of which every node in the same layer, i.e., 
at the same distance from vi, knows the ID of its parent. The protocol must let the nodes of 
layer k, i.e., at distance k > 1 from vi, to know the step when all the nodes of layer k — 1 have 
already written their messages on the whiteboard. 

The algorithm for solving BFS is defined as follows. Initially (when the whiteboard is empty), 
only f 1 must become active and writes its ID, its degree and its layer 0. This is Phase 0. 

At Phase i > 0, any node v in layer i becomes active and its message consists of 

1. its own ID; 

2. its layer i; 

3. the ID of its neighbor in layer i — 1 with minimum ID (this neighbor will be considered as 
its parent in the BFS tree); 

4. the number of its neighbors in layer i — 1; 

5. the number by of its neighbors which are not in layer i — 1 + b^ equals the degree of v); 

6. the number of its neighbors in layer i that have already written their messages on the 
whiteboard. 

We now have to prove that every node becomes active at the right phase and that it can 
compute the required information from its local knowledge and what have previously been 

written on the whiteboard. We prove it by induction on i > 1. In particular, we prove that 
all nodes can decide when Phase i ~ 1 terminates and that, at this step, the number ej_i of 
edges between layer i — 1 and layer i can be computed from the information available on the 
whiteboard. 

Phase terminates when vi writes its message which contains its degree, i.e., eo. Then, all 
neighbors of vi (the vertices of layer 1) become active. The vertices of N{vi) can easily compute 
the required information, no matter the ordering the adversary chooses. Note that information 6 
can be obtained because we consider the FreeSync model, i.e., any node can create its message 
at the step when it is chosen (and not when it becomes active). Moreover, all nodes know the 
number of vertices in layer 1 and so can decide when Phase 1 terminates. Finally, ei is exactly 
the sum of by — 2cy among the vertices v in layer 1. Hence, at the end of Phase 1, the induction 
hypothesis is satisfied. 

Assume the induction hypothesis is satisfied at the end of Phase i > 0. Then, any node that 
has not become active yet and that has a neighbor in layer i knows that it belongs to layer 
i + 1 and becomes active. No matter the ordering the adversary chooses, the vertices in layer 
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i + 1 can easily compute the required information since, by the induction hypothesis, all nodes 
in layer i have written their IDs together with the corresponding layer on the whiteboard. 

Moreover, the nodes can detect the end of Phase i + 1 since exactly equals the sum of 
the tty over the vertices v in layer i + 1. Finally, e^+i is exactly the sum of by — 2cy among the 
vertices v in layer i + 1. 

To conclude, since the vertices know n, they can detect when the communication process 
terminates. Moreover, any node can compute a BFS tree because every vertex has written its 
parent ID on the whiteboard. □ 

Corollary 4. BFS can be solved in the FreeAsync model, in the class of bipartite graphs. 

Proof. In a bipartite graph there are no edges between nodes in the same layer, and therefore 
Cy = for every node v. In other words, we need to apply the protocol for the general case 
without computing information 6. 

Proposition 7. BFS cannot be solved in the SimSync model. 

Proof. For purpose of contradiction, let us assume that there is a protocol V for solving BFS 
in the SimSync model. We design a protocol V' for solving BUILD for any n-nodc graph, 
contradicting Lemma 1. 

First, let C be the class of graphs of even order N = 4n — I {n > 1) that can be obtained as 
follows. A graph G = {H, i) € C is built from any n-node graph H with vertex-set {vi, ■ ■ ■ , Vn} 
and any integer i, 1 < i < n, in the following way: let us add 3n vertices {vn+i, ■ ■ ■ , v^n} = {r = 
Vn+i,ai, ■■■ , a„, 6i, • • • , bi--i,bi+i, ■■■ , 6„, ci, ■ • ■ , Ci_i, Cj+i, • • • ,Cn} such that, for any j < n, r 
is adjacent to aj, for any j < n, j ^ i, bj is adjacent to Vj, Cj is adjacent to aj and to bj, and 
finally, Oj is adjacent to Vi. Note that \C\ = i7(2"'^). 

Let G = {H, k) G C. The key point is that V solves BFS in G whatever be the order in which 
the vertices write their messages. In particular, if the vertices in {■yi, • • • ,Vn} are interrogated 
first, their messages cannot bring any information on which of the Oi is adjacent to r. The 
neighborhood of r can easily be encoded with O(logn) bits since it is adjacent to all Oj, i < n. 
Moreover, the vertices in {vn+2,-- - ,V4n} have degree two in G and so can write their full 
neighborhood. Using these two facts, we design V' that will be used to rebuild any graph in C. 

The protocol V' is defined as follows. For any i < An, the message created and written by 
node Vi when it is interrogated consists of 

— ifi = n + l,r writes its neighborhood. 

— ifn + l<i< 4n, then Vi writes the identifiers of its at most two neighbors; 

— otherwise, let O be the sequence of the vertices in {ui, • • • , Vn} that have been interrogated 
before Vj, in order. Using its local neighborhood and the messages previously written by the 
vertices in O, vi writes the message it would have written following V and in the ordering 
O. 

We now show that whatever be the ordering O in which the 4n vertices have been interro- 
gated, the final content of the whiteboard allows any vertex to build the adjacency matrix of 
G. More precisely, we show that any node can decide whether the edge {vi ,Vj} G E{G) for any 
1 < i < j < 2n. 

Clearly, for any 1 < i < 3n, the edges adjacent to i^„+j can be decided since they appear 
explicitly on the whiteboard. Let O' be the restriction of O to the vertices in {vi,--- ,Vn}- 
Now, for any 1 < z < j < n, we show that, using the information on the whiteboard, any node 
can simulate the protocol V in the graph {H,i), against the ordering C {vn+i, ■ ■ ■ ,t'4n)- By 
construction of {H,i), if the edge {vi,Vj} £ E{G) it must belong to the BFS-tree computed by 
V on {H,i). Hence, any node can decide whether {vi,Vj} does exist in G. 
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More precisely, after the execution of the protocol, the whiteboard contains the messages 
mi, • • • , nin that would have written hy vi, - ■ ■ ,Vn when following protocol V against the order- 
ing O'. Using this information, any node can compute the message that vt {n < t < 4n) would 
have written when executing V in {H,i). Therefore, it can decide whether {vi,Vj} belongs to 
the BFS-tree of {H,i), i.e., whether {vi,Vj} G E{G). 

Since by Lemma 1 and because of the cardinality of C, no protocol can solve BUILD in C 
and in the SimSync model, we get a contradiction. □ 

Open Problem 3. Is it true that FreeAsync < FreeSync? We conjecture that this is the 
case and that in fact BFS cannot be solved in the FreeAsync model. 

5 Conclusion and perspectives 

We have investigated four models of distributed computing, extending the results presented 
in [3]. The definitions of these models arc based on some intuitive conditions related to syn- 
chronicity, and we have seen that imposing them on the system has significant impact on what 
problems can be solved. We proved that there exists a hierarchy of non-decreasing computing 
power between these models. Moreover, wc proved that in two cases the power strictly increases 
and left an open problem to check if it also holds in the third case. 

We have analyzed several problems related to independence, cyclicity and connectivity. For 
these problems, we ask what are the conditions that a distributed computational model re- 
quires to solve them. Motivated by applications in routing [15], we payed special attention to 
connectivity: we analyzed the complexity of general spanning tree and BFS-tree construction. 
In particular, we showed that BFS-tree construction cannot be solved if nodes are not free to 
decide when they activate. On the other hand, the problem can be solved under the additional 
condition of synchronicity. The necessity of synchronicity for BFS-tree construction is left as an 
open problem. 

We only analyzed the systems where no faults are allowed. It would be interesting to see 
what can be done when the system admits computation or communication errors. Another 
possible direction is to further the analysis of construction problems for connected spanning 
subgraphs that satisfy properties needed in compact routing protocols (see [15]). 
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